Quadratic Form (statistics)
   HOME

TheInfoList



OR:

In
multivariate statistics Multivariate statistics is a subdivision of statistics encompassing the simultaneous observation and analysis of more than one outcome variable. Multivariate statistics concerns understanding the different aims and background of each of the dif ...
, if \varepsilon is a
vector Vector most often refers to: *Euclidean vector, a quantity with a magnitude and a direction *Vector (epidemiology), an agent that carries and transmits an infectious pathogen into another living organism Vector may also refer to: Mathematic ...
of n random variables, and \Lambda is an n-dimensional
symmetric matrix In linear algebra, a symmetric matrix is a square matrix that is equal to its transpose. Formally, Because equal matrices have equal dimensions, only square matrices can be symmetric. The entries of a symmetric matrix are symmetric with ...
, then the scalar quantity \varepsilon^T\Lambda\varepsilon is known as a quadratic form in \varepsilon.


Expectation

It can be shown that :\operatorname\left varepsilon^T\Lambda\varepsilon\right\operatorname\left Lambda \Sigma\right+ \mu^T\Lambda\mu where \mu and \Sigma are the expected value and variance-covariance matrix of \varepsilon, respectively, and tr denotes the
trace Trace may refer to: Arts and entertainment Music * ''Trace'' (Son Volt album), 1995 * ''Trace'' (Died Pretty album), 1993 * Trace (band), a Dutch progressive rock band * ''The Trace'' (album) Other uses in arts and entertainment * ''Trace'' ...
of a matrix. This result only depends on the existence of \mu and \Sigma; in particular, normality of \varepsilon is ''not'' required. A book treatment of the topic of quadratic forms in random variables is that of Mathai and Provost.


Proof

Since the quadratic form is a scalar quantity, \varepsilon^T\Lambda\varepsilon = \operatorname(\varepsilon^T\Lambda\varepsilon). Next, by the cyclic property of the
trace Trace may refer to: Arts and entertainment Music * ''Trace'' (Son Volt album), 1995 * ''Trace'' (Died Pretty album), 1993 * Trace (band), a Dutch progressive rock band * ''The Trace'' (album) Other uses in arts and entertainment * ''Trace'' ...
operator, : \operatorname operatorname(\varepsilon^T\Lambda\varepsilon)= \operatorname operatorname(\Lambda\varepsilon\varepsilon^T) Since the trace operator is a linear combination of the components of the matrix, it therefore follows from the linearity of the expectation operator that : \operatorname operatorname(\Lambda\varepsilon\varepsilon^T)= \operatorname(\Lambda \operatorname(\varepsilon\varepsilon^T)). A standard property of variances then tells us that this is : \operatorname(\Lambda (\Sigma + \mu \mu^T)). Applying the cyclic property of the trace operator again, we get : \operatorname(\Lambda\Sigma) + \operatorname(\Lambda \mu \mu^T) = \operatorname(\Lambda\Sigma) + \operatorname(\mu^T\Lambda\mu) = \operatorname(\Lambda\Sigma) + \mu^T\Lambda\mu.


Variance in the Gaussian case

In general, the variance of a quadratic form depends greatly on the distribution of \varepsilon. However, if \varepsilon ''does'' follow a multivariate normal distribution, the variance of the quadratic form becomes particularly tractable. Assume for the moment that \Lambda is a symmetric matrix. Then, :\operatorname \left varepsilon^T\Lambda\varepsilon\right= 2\operatorname\left Lambda \Sigma\Lambda \Sigma\right+ 4\mu^T\Lambda\Sigma\Lambda\mu. In fact, this can be generalized to find the
covariance In probability theory and statistics, covariance is a measure of the joint variability of two random variables. If the greater values of one variable mainly correspond with the greater values of the other variable, and the same holds for the ...
between two quadratic forms on the same \varepsilon (once again, \Lambda_1 and \Lambda_2 must both be symmetric): :\operatorname\left varepsilon^T\Lambda_1\varepsilon,\varepsilon^T\Lambda_2\varepsilon\right2\operatorname\left Lambda _1\Sigma\Lambda_2 \Sigma\right+ 4\mu^T\Lambda_1\Sigma\Lambda_2\mu. In addition, a quadratic form such as this follows a generalized chi-squared distribution.


Computing the variance in the non-symmetric case

Some texts incorrectly state that the above variance or covariance results hold without requiring \Lambda to be symmetric. The case for general \Lambda can be derived by noting that :\varepsilon^T\Lambda^T\varepsilon=\varepsilon^T\Lambda\varepsilon so :\varepsilon^T\tilde\varepsilon=\varepsilon^T\left(\Lambda+\Lambda^T\right)\varepsilon/2 is a quadratic form in the symmetric matrix \tilde=\left(\Lambda+\Lambda^T\right)/2, so the mean and variance expressions are the same, provided \Lambda is replaced by \tilde therein.


Examples of quadratic forms

In the setting where one has a set of observations y and an operator matrix H, then the residual sum of squares can be written as a quadratic form in y: :\textrm=y^T(I-H)^T (I-H)y. For procedures where the matrix H is
symmetric Symmetry (from grc, συμμετρία "agreement in dimensions, due proportion, arrangement") in everyday language refers to a sense of harmonious and beautiful proportion and balance. In mathematics, "symmetry" has a more precise definiti ...
and
idempotent Idempotence (, ) is the property of certain operations in mathematics and computer science whereby they can be applied multiple times without changing the result beyond the initial application. The concept of idempotence arises in a number of pl ...
, and the errors are
Gaussian Carl Friedrich Gauss (1777–1855) is the eponym of all of the topics listed below. There are over 100 topics all named after this German mathematician and scientist, all in the fields of mathematics, physics, and astronomy. The English eponymo ...
with covariance matrix \sigma^2I, \textrm/\sigma^2 has a
chi-squared distribution In probability theory and statistics, the chi-squared distribution (also chi-square or \chi^2-distribution) with k degrees of freedom is the distribution of a sum of the squares of k independent standard normal random variables. The chi-squar ...
with k degrees of freedom and noncentrality parameter \lambda, where :k=\operatorname\left I-H)^T(I-H)\right/math> :\lambda=\mu^T(I-H)^T(I-H)\mu/2 may be found by matching the first two
central moment In probability theory and statistics, a central moment is a moment of a probability distribution of a random variable about the random variable's mean; that is, it is the expected value of a specified integer power of the deviation of the random ...
s of a noncentral chi-squared random variable to the expressions given in the first two sections. If Hy estimates \mu with no
bias Bias is a disproportionate weight ''in favor of'' or ''against'' an idea or thing, usually in a way that is closed-minded, prejudicial, or unfair. Biases can be innate or learned. People may develop biases for or against an individual, a group ...
, then the noncentrality \lambda is zero and \textrm/\sigma^2 follows a central chi-squared distribution.


See also

* Quadratic form * Covariance matrix *
Matrix representation of conic sections In mathematics, the matrix representation of conic sections permits the tools of linear algebra to be used in the study of conic sections. It provides easy ways to calculate a conic section's axis, vertices, tangents and the pole and polar relatio ...


References

{{DEFAULTSORT:Quadratic Form (Statistics) Statistical theory Quadratic forms